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Abstract 

For a graph G, let t(G) denote the maximum number of vertices in an induced subgraph of 
G that is a tree. In this paper, we study the problem of bounding t(G) for graphs which do not 
contain a complete graph K r on r vertices. This problem was posed twenty years ago by Erdos, 
Saks, and Sos. Substantially improving earlier results of various researchers, we prove that every 
connected triangle-free graph on n vertices contains an induced tree of order y/n. When r > 4, we 
also show that t(G) > for every connected i^-free graph G of order n. Both of these bounds 

are tight up to small multiplicative constants, and the first one disproves a recent conjecture of 
Matousek and Samal. 

1 Introduction 

For a graph G, let t(G) denote the maximum number of vertices in an induced subgraph of G that 
is a tree. The problem of bounding t{G) in a connected graph G was first introduced twenty years 
ago by Erdos, Saks, and Sos [5]. Clearly, to get a non-trivial result one must impose some conditions 
on the graph G, because, for example, the complete graph contains no induced tree with more than 

2 vertices. In their paper, Erdos, Saks, and Sos studied the relationship between t(G) and several 
natural parameters of the graph G. They were able to obtain asymptotically tight bounds on t(G) 
when either the number of edges or the independence number of G were known. 

Erdos, Saks, and Sos also considered the problem of estimating the size of the largest induced tree 
in graphs with no K r (complete graph on r vertices). Let t r (n) be the minimum value of t(G) over all 
connected iT r -free graphs Grain vertices. In particular, for triangle-free graphs, they proved that 

/ fog^\ < < O(V^logn), 

\ log log n / 

and left as an interesting open problem the task of closing the wide gap between these two bounds. 

The first significant progress on this question was made only recently by Matousek and Samal 
|15j . who actually came to the problem of estimating £3(71) from a different direction. Pultr had been 
studying forbidden configurations in Priestley spaces [2], and this led him to ask in [T7] how large t{G) 
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could be for connected bipartite graphs G. Let is(n) be the minimum value of t{G) over all connected 
bipartite graphs on n vertices. It is clear that t^{n) < ts(n), so the result of Erdos, Saks, and Sos 
immediately gives a lower bound on i_e(n). 

Motivated by Pultr's question, Matousek and Samal studied is(n) and (n). They found the 
following nice construction which shows that ts(n) < ts(n) < 2^/n+ 1. Let m = y/n, and consider the 
graph with parts V- m +i, V-m+2, ■ ■ ■ , V m -i, where \Vi\ = m — and each consecutive pair of parts 
(Vi, Vi+i) induces a complete bipartite graph. This graph is clearly bipartite with m 2 = n vertices, and 
it is easy to see that every induced tree in it has at most 2m — 1 vertices. On the other hand, Matousek 
and Samal were able to improve the lower bound on ts{n) and t%{n), showing that t^{n) > e Cv/log n for 
some constant c. Furthermore, they also proved that if there was even a single value of no for which 
^3( n o) < v/^Oj then in fact ts(n) < O(n^) for some constant (5 strictly below 1/2. The above fact 
led Matousek and Samal to conjecture that the true asymptotic behavior of t^{n) was some positive 
power of n which is strictly smaller than 1/2. 

Our first main result essentially solves this problem. It determines that the order of growth of 
both tz(n) and tfi(n) is precisely @(y/n), disproving the conjecture of Matousek and Samal. 

Theorem 1. Let G be a connected triangle-free graph on n vertices. Then t(G) > y/n. 

Furthermore, our approach can also be used to give asymptotically tight bounds on the size of the 
largest induced tree in i^ r -free graphs for all remaining values of r. In their original paper, Erdos, Saks, 
and Sos gave an elegant construction which shows that t r (n) for r > 4 has only logarithmic growth. 
Indeed, let T be a balanced (r — l)-regular tree, that is, a rooted tree in which all non-leaf vertices 
have degree r — 1 and the depth of any two leaves differs by at most 1. Then the line grapl0 L(T) is 
clearly K r -£ree, and one can easily check that induced trees in L(T) correspond to induced paths in 
T, which have only logarithmic length. Optimizing the choice of the parameters in this construction, 
one can show that t r (n) < ^j^^^ + 2. On the other hand, using Ramsey Theory, Erdos, Saks, and 

Sos also showed that t r (n) > ^g^g „ ? where c r is a constant factor depending only on r. Our second 
main result closes the gap between these two bounds as well, and determines the order of growth of 
t r (n) up to a small multiplicative constant. 

Theorem 2. Let r > 4, and let G be a connected graph on n vertices with no clique of size r. Then 
t(G) > 

One can also study induced forests rather than trees in K r -hee graphs. Let f r (n) be the maximum 
number such that every i^ r -free graph on n vertices contains an induced forest with at least f r (n) 
vertices. Trivially we have f r (n) > t r (n), but it appears that the size of the maximum induced forest 
in a graph is more closely related to another parameter. The independence number a(G) of a graph is 
the size of the largest independent set of vertices in G. Since an independent set is a forest and every 
forest is bipartite, the size of the largest induced forest in a graph G is at least a(G) and at most 
2a(G). Using the best known upper bound for off-diagonal Ramsey numbers [I], for fixed r > 3 and 

1 r-2 

all n we have f r {n) > cn r ~ 1 log r - 1 n for some positive constant c. Hence, f$(n) is larger than t$(n) by 
a factor of cylogn. Furthermore, for fixed r > 3, f r {n) and t r (n) behave very differently, as f r (n) is 

The vertices of L(T) are the edges of T, and two of them are adjacent if they share a vertex in T. 
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polynomial in n while t r (n) is only logarithmic. This demonstrates that in K r -fiee graphs the largest 
guaranteed induced forest is much larger than the largest guaranteed induced tree. 

We close this introduction by mentioning some related research. Our work considers the Ramsey- 
type problem of finding either a clique or a large induced tree. The similar problem of finding an 
induced copy of a particular tree T in a i^-free graph was independently raised by Gyarfas [9] and 
Sumner [20j. They conjectured that for any fixed integer r and tree T, any graph with sufficiently 
large chromatic number (depending on r and T) must contain either an r-clique or an induced copy 
of T. Note that the essential parameter for the graph G is now the chromatic number and not the 
number of vertices. Indeed, a complete bipartite graph has no clique of size 3, but contains only stars 
as induced subtrees. This conjecture is widely open, although some partial results were obtained in 

ma hd Hang. 

Induced trees were also studied in the context of sparse random graphs. This line of research 
was started by Erdos and Palka [4], who conjectured that for any constant c > 1, the random graph 
G(n,c/n) would with high probability contain an induced tree of order 7(c)n. This was solved by 
Fernandez de la Vega [6], and other variants of this result were obtained in [HI El 119] . In 
another regime, when the edge probability is p = clogn/n, Palka and Rucihski [16J showed that the 
largest induced tree in G(n,p) has size 0(nloglogn/logn) with high probability. 

The rest of this paper is organized as follows. In Section [2] we discuss the proof of Theorem [H and 
show how to reduce it to an abstract optimization problem on certain bipartite graphs with weights on 
the vertices. The solution of this problem is provided in the following section. In Section 01 we show 
how to extend our argument to the case of K r -free graphs with r > 4, and prove our second result, 
Theorem [2j The final section of the paper contains a few concluding remarks. Throughout our paper, 
we will omit floor and ceiling signs whenever they are not essential, to improve clarity of presentation. 

2 Triangle-free graphs 

The main idea in the proof of Theorem Q] is to use induction to prove a slightlj@ stronger statement. 
Instead of finding a single induced tree, we show that no matter which vertex v of the graph we choose, 
there exists a large induced tree which contains v. More precisely, we prove that any connected, 
triangle-free graph with n + 1 vertices contains an induced tree of size \fn + 1 through any given 
vertex. 

This is obviously true for n = 1, which serves as the base of our induction. It remains to prove the 
statement for general n > 2, while assuming its truth for all smaller values of n. So, let G = (V, E) 
be an arbitrary connected triangle- free graph with n + 1 vertices, and fix an arbitrary vertex v £ V. 
We will find a large induced tree through v. Note that since G is triangle- free, {v} U N(v) induces a 
star. Therefore, we may assume that the size of the neighborhood satisfies |-/V(f)| < y/n, or else we 
are done. 

Consider the subgraph of G induced by V\({v}L)N(v)). It decomposes into connected components, 
whose vertex sets we call Vx, . . . , V m . Now suppose that we could find a subset U C N(v), and a subset 
I C [to], with the following properties: 

We will discuss the relative strength of this statement in detail in our concluding remarks. 
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(i) For each i G I, there is exactly one u G U which is adjacent to at least one vertex in Let us 

denote this vertex by u(i). 

(ii) The sum Ylfei \/\V{\ is at least yj U . . . U V m \. 

Then, for each i £ I, we could apply the induction hypothesis to the connected subgraph of G induced 
by {u(i)} U Vi- This would give an induced tree Tj containing u(i), of size 1 + v/|Vj|- Furthermore, it 
is easy to see that the union of {v} with all of the above constructed trees Tj is also an induced tree. 
Indeed, since each V% is a maximal connected component, there are no edges between the Tj, and since 
G is triangle- free, there are no edges inside U C N[v). Therefore, we will have an induced tree with 
total size at least: 

IMI + IM*) + > i + i + Vl^iu...uy m | 

= 2 + V\V\({v}UN(v)\) 

> 2 + ^/(n + l)-l-v^ 

> 1 + Vn, 

as desired. Thus, the following abstract lemma completes the proof. 

Lemma 1. Consider a bipartite graph with sides A and B, with the property that each vertex in B 
has degree at least 1. Let each vertex i G B have an associated weight Wi > 0. We call a subset 
H CZ AU B admissible if each vertex v G B(~)H has exactly one neighbor in Af]H . Then there exists 
an admissible H with YlieBnH V^i — V^ieB w i- 

The connection between this lemma and our required selection of I C [m] and U C N(v) is clear. 
The sides A and B correspond to the sets N(v) and [m], respectively, and the weights W{ are precisely 
the sizes of the connected components \Vi\. The requirement that each vertex in B has degree at least 
1 is satisfied by the fact that G is connected, and so each component Vi has at least one neighbor in 
N{v). Therefore, this lemma will indeed complete the proof of Theorem [TJ 



3 Main lemma 

Before proving our main lemma, Lemma [H let us discuss an easy special case which we will actually 
need later in our study of K r -free graphs when r > 4. Observe that if the weights Wi in Lemma [Tj were 
roughly equal, then one way to control the objective Yli^BnH V™i wou ld be to find a lower bound on 
\B fl H\. This motivates the following claim, which we record for later use. 

Lemma 2. Consider a bipartite graph with sides A and B, with the property that each vertex in B 
has degree at least 1. We still call a subset H C A U B admissible if each vertex v G B n H has 
exactly one neighbor in A n H . Then there exists an admissible H with \BC\H\ > \f\B\ . 

Proof. The key observation, which we will also use in the proof of Lemma HJ is that we may assume 
that every v G A has some vertex w G B which is adjacent only to v. Indeed, suppose this is not 
the case, and every neighbor of v has additional neighbors in A. Then, deleting v from A will not 
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break the hypothesis of the lemma. Therefore, after repeatedly performing this reduction, we obtain 
a bipartite graph in which every vertex v £ A has a neighbor in B that sees only v. Notice that this 
implies that there is an induced matching between A and some subset B' C B. 

If | -A | > v/|-B|, then the induced matching immediately yields an admissible set H = AUB' which 
satisfies the assertion. On the other hand, when \A\ < \/\B\, there is a vertex in A with degree at least 
\f~B. Indeed, since every vertex in B has degree at least 1, the total number of edges in the bipartite 
graph is at least \B\, and therefore some vertex v G A has degree > |-B|/|A| > y/~B. The induced star 
H = {v} U N(v) provides the desired admissible set. □ 

We pause now to remark that Lemma [2] is far from being sharp. In fact, it is always possible to 
find an admissible H with \H n B\ > fi(]-B]/ log and this is tight. Although we do not need 
this result for our proof we sketch it here for the sake of completeness. By the reduction in the proof 
of Lemma [21 we may assume that there is an induced matching between A and a subset B' C B. 
In particular, this implies that all degrees in B are at most \A\ = \B'\ < \B\. The set of possible 
degrees {1, 2, . . . , \B\} is covered by the family of log 2 \B\ dyadic intervals = [2 fc ,2 fc+1 ], so there 
must be some Ik with the property that at least |U|/log 2 \ B\ vertices of B have degrees in Ik- Sample 
a random subset A' C A by taking each vertex independently with probability p = 2~ k ~ 1 , and let 
B" be the set of all vertices in B that are adjacent to exactly one vertex in A'. It is clear that 
H = A 1 U B" is admissible, so it remains to control \B"\. Any vertex v € B has probability exactly 
¥[Bm(d(v),2- k - 1 ) = 1] = d{v)2~ k ~ l {l - 2- fc - 1 ) rf M- 1 of being chosen for B". Since I k = [2 fc ,2 fc+1 ], 
when d(v) E Ik this probability is bounded from below by an absolute constant (one can take 1/8). 
Hence the expected size of B" is at least 0(|lfc|) > 0(|-B|/log \ B\), which implies that there must exist 
some choice of A' and B" that satisfy this bound. 

The following construction shows that this bound is asymptotically tight. Choose integers m = 2 k , 
let A = Z/mZ, and let B = B$ U . . . U B^, where each Bi = {bn, . . . ,bi m }. Let each bij be 
adjacent to precisely {i, i + 1, . . . , i + 2 J ' — 1} G A, where we reduce everything modulo m. This 
has \B\ = (k + l)m = G(mlogm), but it is not too difficult to verify that any admissible H has 
\HnB\< 2m. 

3.1 Proof of Lemma [1] 

Unfortunately, Lemma [2] is insufficient in general for our application, because in our triangle-free 
graph, the sizes of the connected components Vi of V \ ({v} U N(v)) may differ wildly. For this, we 
need its weighted variant, which we prove in this section. The main trick in the proof is to vary the 
weights, which leads us to study the following function. 

Definition 1. Let G be a bipartite graph with vertex set AD B. For notational convenience, let the 
vertices of B be named {1,2,..., m}. Then, we define 

F G (wi, . . . ,w m ) = max V] y/wi, 

admissible HcAUB £ — ' 

where we still say that a nonempty subset H C A U B is admissible when every vertex in B n H has 
exactly one neighbor in An H. 
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Lemma[T]is thus equivalent to the statement that Fq(w) > Wi for any collection of Wi > 0. 

Since this inequality is homogeneous in the Wi, from now on we will always assume that the weights 
have been normalized to sum to 1. It then suffices to show that Fq(vs) > 1 for all w that satisfy the 
constraints Wi > 0, w i = 1- Observe that this domain is now compact, and Fq is a maximum of a 
finite collection of continuous functions, hence continuous. Therefore, Fq attains its infimum on this 
domain, which we will denote mm^ Fq. 

So, suppose for the sake of contradiction that we have some graph G of minimum order for which 
min w Fq < 1. Graph G must have an induced matching between A and some subset B' C B, since 
otherwise we can use the same reduction argument as in the proof of Lemma[2]to obtain a contradiction 
to the minimality of G. Let (w\, . . . , w m ) be a minimizing assignment for Fq, satisfying the constraints 
Wi > and ^ W{ = 1. Note that actually all Wi must be strictly positive, or else we could delete a 
vertex i with w^ = to obtain a proper induced subgraph G' of G and a weight assignment w' for 
which Fq>(w') < 1, again contradicting the minimality of G. 

We now exploit the fact that we have cast our problem in a continuous setting. Let us study the 
effect of performing the following perturbation on the weights. 

Stage 1. For each i S B' , let w' i = wi — e-^/wJ. For i B' , let w[ = Wi. 

Stage 2. To compensate for the fact that ^w- = 1 — e^j 6 ^/ ^/wj < L renormalize by scaling up 
every weight by the same proportion. That is, for all % G B, let 



wi 



1 " e J2jeB> v 7 ^ 



Note that since all Wi > 0, for all sufficiently small e, all new w[ are still positive. This perturbation 
is chosen in the particular way because for small e and i E B', \JvJ i = y/wl— | +o(e). This is because 
it is easy to check that for every x > 0, 

lim V x ~ e V% ~ (V* ~ e / 2 ) = o 

So, the effect on the square root of each weight Wi with i S B' is roughly the same, no matter what 
the weight is. 

Now recall that the function Fq(wi, . . . ,w m ) is defined as the maximum over all admissible H of 
Siesnff Since all Wi > by definition, this is equal to the maximum over all maximal admissible 
H, where this maximality is defined with respect to set inclusion. For brevity, let M = Fq(w\, . . . , w m ) 
be that maximum and let H be any such maximal admissible selection. 

Note that H must intersect B' (the subset of B that has an induced matching to A). This is 
because any maximal admissible H contains at least one vertex from A, and that vertex's partner in 
B' can be added to H while preserving admissibility. In particular, the sum J2i £ BnH includes at 
least one downwardly perturbed weight from B'. Therefore, 

-| + o(e)<M-| + o(e). 
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The renormalization that converts w[ into w'- is particularly simple to analyze. Using the previous 
inequality and the observation that YLjeB' — M (because B' U A is an induced matching, hence 
admissible): 

M-4 + o(e) M-4 + o(e) 



Y Jw" < z • y = < 



The final bound is independent of H, so if it were strictly smaller than M, we would have 
Fq(w'[, . . . , w'm) < M = Fciwi, . . . , w m ), contradicting the minimality of (wi, . . . , w m ). Therefore, 
we must have: 



> M 



M-f +o(e) 

y/l-eM 
M- € - + o(e) > My/1 - eM 

M-~+o(e) > M(l-^ + o(eM) 



~2+o(e) > — + o{eM 2 ) 

l-o(l) < M 2 . 

In the final inequality, we used the fact that M is fixed, and therefore o(M 2 ) = o(l). Sending e to zero, 
we conclude that Fg(w) = M > 1. This contradicts our assumption that Fg(w) = min w < 1, so 
our proof is complete. □ 

Remark. The following example shows that the assertion of Lemma Q] no longer holds for any 
exponent a > 1/2. Indeed, consider the following bipartite graph. For some very large t, let A = 
{ai, . . . , a t }, let B = {bo, . . . , bt}, and connect each aj to &o an d &i- Let the weight of bo be 1 — t , 
and the weights of all other vertices in B be t~ 2 , so the total weight is 1. It is easy to see that the only 
maximal admissible sets in this graph are either a star containing bo and some other 6j, or the induced 
matching between A and B \ {bo}- Since a > 1/2 and t is sufficiently large, we have in the first case 
that (1 — t~ l ) a + (t~ 2 ) a = 1 — at~ l + o(t _1 ) < 1. On the other hand, for the second admissible set, 
we only have t ■ (t' 2 ) a = t l ~ 2a < 1. 



4 i^ r -free graphs 

This section is devoted to the proof of Theorem [2j The induction approach we used in Section [2] easily 
extends to the case of K r -free graphs when r > 4, and in fact the argument becomes even simpler. We 
prove that for any r > 4, every connected K r -iree graph G = (V, E) with n + 1 vertices contains an 
induced tree of size 4°^. + 1 through any particular vertex. Note that since the logarithm appears in 
both the numerator and denominator, its base is irrelevant. The statement is clearly true for n = 1, 
which starts our induction. 

Now, consider any n > 2, and suppose that the statement holds for all smaller values of n. Let 
v S V be an arbitrary vertex. We will find an induced tree of size 4°^. + 1 containing v. Recall the 
well-known fact from Ramsey Theory (see, e.g., chapter 6.1 of [3]) that any graph with a b > ( a ~^^~ l 2 ) 
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vertices contains either a clique of size a or an independent set of size b. This implies that the degree 
of v must be less than r lo s n / 41o g r — 77V 4 , or else we would already be done. Indeed, since G is -ff r -free, 
the neighborhood of v would then contain an independent set of size 4°^. • The vertices of this set 
together with v form an induced star of the desired size. The same argument also shows that every 
w £ N(v) has less than re 1//4 neighbors in V \ ({v} U N(v)). Otherwise, by the above discussion, we 
could find an independent set I C V \ ({v} U N(v)) of size j^p, all of whose vertices are adjacent to 
w. Then, v, w, and / will form a large induced tree containing v. 

Let Vi, . . . , V m be the vertex sets of the connected components of the subgraph of G induced by 
V \ ({v} U N(v)). Since G is connected, each Vi is adjacent to some vertex in N(v). As we explained 
above, each vertex in N(v) is adjacent to fewer than n 1//4 sets Vi, so in particular m < \N(v 
n 1 / 2 . We claim that all components Vi have size at most \. Indeed, suppose that some exceeds 
\. Let u be a vertex in N(v) which is adjacent to at least one vertex in Vi. Applying the induction 

hypothesis to {u} UVi, we find an induced tree T through u of size lo |j"^ - + 1 = ■ Then {v} U T 
gives an induced tree of the desired size. 

Next, we show that there are more than r 2 indices i for which \V%\ > Indeed, if this were not 
the case, then the total number of vertices in V would be less than: 



|WUiV(t;)| + ^|K;|<l + « 1/ Sm-^ + r 2 ^<l + n 1 / 4 + 2^.<l+n 1 / 4 + ^. 



This is less than n + 1 = \ V\ for all n > 2, which is a contradiction. 

Let B be the above set of indices for which \ Vi\ > -pr, and let A = N(v). Consider the auxiliary 
bipartite graph with sides A and B obtained by connecting u £ A with i G B if u is adjacent to at 
least one vertex in Vi. Applying Lemma [21 we find subsets A' C A and B' C B with \B'\ > W\B\ > r 
such that for each i G B' the component Vi is adjacent to exactly one vertex in A' C N(v), which we 
denote u(i). In fact, \B'\ > r + 1 since both \B'\ and r are integers. Apply the induction hypothesis 
to each {u(i)} U Vi to find an induced tree Tj containing u(i) of size at least ^fj^i + 1- 

If all u(i) are distinct, then we can find u(i) 7^ u(j) which are not adjacent in G, because this is a 
set of at least r + 1 vertices in a K r -fvee graph. Then, {v, u(i),u(j)} UTjU Tj is an induced tree. On 
the other hand, if there is some u(i) = u(j), then {v, u(i)} U Tj U Tj is an induced tree. In either case, 
we find an induced tree containing v of size at least 

log |K| t log\Vj\ ^ n , log(^/n/r 2 ) _ 1 ( logra 



\{v, u(i),u(j)}\ + + -^1 > 2 + 2 • " T v ' ' = 1 + 



41ogr 41ogr 41ogr 41ogr 

as desired. This completes the proof. □ 



5 Concluding remarks 

In this paper, we obtain a lower bound on the size of the largest induced-tree in a i^ r -free graph, 
which is tight up to a small multiplicative constant. Moreover, our proof shows that we can find a 
large tree through any particular vertex in the graph. It turns out that this seemingly stronger result 
is equivalent up to a constant factor to the original problem of finding one large tree. 
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Claim. Let T be an induced tree in a connected graph G, and let v be an arbitrary vertex. Then G 

ITI 

has an induced tree of size 1 + ^ which contains v. 

Proof. If v is already in T, then there is nothing to prove. Otherwise, let P = (t>i,t>2, • • • ,v m ) be 
a shortest path between v and T, with v\ = v and v m £ T. By minimality of P, there are no edges 
between {v\, . . . , f m -2} and T. Let ei, . . . , e& be the edges connecting v m -\ and T, and let ti,...,tk 
be their endpoints in T. Since T is a tree, by deleting some edges, we can partition it into subtrees T\, 
. . . , Tfc, such that each Tj contains tj. Consider the auxiliary graph on vertices, in which vertices i 
and j are adjacent if there is an edge of G between Tj and Tj. Note that this graph also forms a tree, 
and therefore is bipartite. Hence, we can find two disjoint subsets I U J = [k] such that the collection 
of Ti with i £ I has no edges crossing between them, and similarly, the collection of Tj with j £ J also 
has no crossing edges. Therefore, the union of {v\, . . . , f m -i} with either one of these two collections 
will form an induced tree. Clearly, both of these trees contain v, and their union covers T. Thus, one 

ITI 

of them has size at least 1 + ^ . □ 

We also wish to remark that for the problem of finding a large induced tree through every vertex 
of a triangle-free graph, one can improve the upper bound of Matousek and Samal. Indeed, 
consider the following triangle-free graph on n vertices. Let m = \/2n, and take the graph with 
parts Vq, . . . , V m -i, where Vq = {v}, every other \Vi\ = m — i, and each consecutive pair of parts 
(Vi, Vi+i) induces a complete bipartite graph. This is a bipartite (hence also triangle- free) graph with 
1 + m(m — l)/2 = (1 + o(l))n vertices, but one can easily check that any induced tree containing v 
has at most m = y/2n vertices. In particular, this shows that any approach which guarantees a large 
tree through every vertex of the graph cannot match Matousek and Samal's upper bound. 

In light of this discussion, we do not have a clear conjecture as to what is the right constant in 
front of \fn in the problem of finding a maximum induced tree in a triangle-free graph on n vertices. 
Nevertheless, as the upper and lower bounds are now so close, perhaps there is a hope to bridge this 
gap with other methods. 
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